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In this aim, we model the observed data by a two-type Galton- Watson pro- 
cess consistent with the binary Bee structure of the data. Under indepen- 
dence between the process leading to the missing data and the BAR process 
and suitable assumptions on the driven noise, we establish the strong con- 
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1. Introduction 

Bifurcating autoregressive processes (BAR) generalize autoregressive (AR) pro- 
cesses, when the data have a binary tree structure. Typically, they are involved 
in modeling cell lineage data, since each cell in one generation gives birth to two 
offspring in the next one. Cell lineage data usually consist of observations of 
some quantitative characteristic of the cells, over several generations descended 
from an initial cell. BAR processes take into account both inherited and en- 
vironmental effects to explain the evolution of the quantitative characteristic 
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under study. They were first introduced by Cowan and Staudte [4]. In tlieir pa- 
per, the original BAR process was defined as follows. The initial cell is labelled 
1, and the two ofi^spring of cell k are labelled 2k and 2k + 1. If Xk denotes the 
quantitative characteristic of individual k, then the first-order BAR process is 
given, for all fc > 1, by 



The noise sequence (£2fe,£2fc+i) represents environmental effects, while a,b are 
unknown real parameters, with |&| < 1, related to the inherited effects. The 
driven noise (e2fe, e2fe-i-i) was originally supposed to be independent and identi- 
cally distributed with normal distribution. But since two sister cells are in the 
same environment at their birth, e2k and e2fe+i £^re allowed to be correlated, in- 
ducing a correlation between sister cells, distinct from the correlation inherited 
from their mother. 

Recently, experiments made by biologists on aging of Escherichia coli [15], 
motivated mathematical and statistical studies of the asymmetric BAR process, 
that is when the quantitative characteristics of the even and odd sisters are al- 
lowed to depend on their mother's through different sets of parameters (a, 6), 
see Equation (2.1) below. In [9, 8], Guyon proposes an interpretation of the 
asymmetric BAR process as a bifurcating Markov chain, which allows him to 
derive laws of large numbers and central limit theorems for the least squares 
estimators of the unknown parameters of the process. This Markov chain ap- 
proach was further developed by Bansaye [2] in the context of cell division with 
parasite infection, and by Delmas and Marsalle [5], where the cells are allowed 
to die. Another approach based on martingales theory was proposed by Bercu, 
de Saporta and Gegout-Petit [3] , to sharpen the asymptotic analysis of Guyon 
under weaker assumptions. 

The originality of this paper is that we take into account possibly missing 
data in the estimation procedure of the parameters of the asymmetric BAR 
process, see Figure 1 for an example. This is a problem of practical interest, 
as experimental data are often incomplete, either because some cells died, or 
because the measurement of the characteristic under study was impossible or 
faulty. For instance, among the 94 colonies dividing up to 9 times studied in 
[15], in average, there are about 47% of missing data. It is important to take 
this phenomenon into account in the model for a rigorous statistical study. 

Missing data in bifurcating processes were first modeled by Delmas and 
Marsalle [5] . They defined the genealogy of the cells through a Galton- Watson 
process, but they took into account the possible asymmetry problem only by 
differentiating the reproduction laws according to the daughter's type (even or 
odd). The bifurcating process was thus still a Markov chain. However, consid- 
ering the biological issue of aging in E. coli naturally leads to introduce the 
possibility that two cells of different types may not have the same reproduc- 
tion law. In this paper, we thus introduce a two-type Galton- Watson process to 
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Figure 1. A tree associated with the bifurcating auto-regressive process up to the 4th gener- 
ation. The dashed cells are not observed. 

model the genealogy, and lose the Markovian structure of the bifurcating chain, 
so that we cannot use the same approach as [5]. Instead, we use the martingale 
approach introduced in [3]. It must be pointed out that missing data are not 
dealt with in [3], so that we cannot directly use their results either. In particular, 
the observation process is another source of randomness that requires stronger 
moment assumptions on the driven noise of the BAR process and careful choice 
between various filtrations. In addition, the normalizing terms are now random 
and the convergences are only available on the random non-extinction set of the 
observed process. 

The naive approach to handle missing data would be to replace the sums over 
all data in the estimators by sums over the observed data only. Our approach is 
slightly more subtle, as we distinguish whether a cell has even or odd daughters. 
We propose a joint model where the structure for the observed data is based 
on a two-type Galton- Watson process consistent with the possibly asymmetric 
structure of the BAR process. See e.g. [12, 1, 10] for a presentation of multi-type 
Galton- Watson processes and general branching processes. Note also that our 
estimation procedure does not require the previous knowledge of the parameters 
of the two-type Galton- Watson process. 

This paper is organized as follows. In Section 2, we first introduce our BAR 
model as well as related notation, then we define and recall results on the two- 
type Galton- Watson process used to model the observation process. In Section 3, 
we give the least square estimator for the parameters of observed BAR process 
and we state our main results on the convergence and asymptotic normality of 
our estimators as well as estimation results on data. The proofs are detailed in 
the following sections. 
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2. Joint model 

We now introduce our joint model, starting with the asymmetric BAR process 
for the variables of interest. 

2.1. Bifurcating autoregressive processes 

On the probability space {D,,A,V), we consider the first-order asymmetric BAR 
process given, for all /c > 1, by 

1 = a + bXk + S2k, ^2 X) 

\ X2k+i = c + dXk + S2k+i- 

The initial state Xi is the characteristic of the ancestor, while (e2fc, £2fc+i) is the 
driven noise of the process. In all the sequel, we shall assume that E[Xf] < oo. 
Moreover, as in the previous literature, the parameters (a, &, c, d) belong to 
with 

< max(|fe|, < 1. 

This assumption ensures the stability (non explosion) of the BAR process. As 
explained in the introduction, one can see this BAR process as a first-order au- 
toregressive process on a binary tree, where each vertex represents an individual 
or cell, vertex 1 being the original ancestor, see Figure 2 for an illustration. We 



T„ 




Figure 2. The tree associated with the bifurcating auto-regressive process. 

use the same notation as in [3]. For all n > 1, denote the n-th generation by 
G„ = {2", 2" + 1, . . . , 2"+i - 1}. In particular. Go = {1} is the initial genera- 
tion, and Gi = {2, 3} is the first generation of offspring from the first ancestor. 
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Let be the generation of individual k, which means that r^. — [log2(fc)], 
where [x] denotes the largest integer less than or equal to x. Recall that the 
two offspring of individual k are labelled 2k and 2fc + 1, or conversely, the 
mother of individual k is [k/2]. More generally, the ancestors of individual k 
are [k/2], [k/2\ . . . , [k/2'''']. Denote by T„ = ULo^^^the sub-tree of all indi- 
viduals from the original individual up to the n-th generation. Note that the 
cardinality |G„| of G„ is 2", while that of T„ is |T„| = 2"+-'- — 1. Next, T denotes 
the complete tree, so to speak T = U„>o G„ = U„>o = N* N\{0}. Finally, 
we need to distinguish the individuals in G„ and T„ according to their type. 
Since we are dealing with the types even and odd, that we will also label and 
1, we set 

= G„ n (2N) , G,\ = G„ n (2N + 1) , = T„ n (2N) , 
T,;^ =T„n (2N + 1), T°=Tn(2N) and = T n (2N 1). (2.2) 

We now state our assumptions on the noise sequence. Denote by F = {J-n) 
the natural filtration associated with the first-order BAR process, which means 
that J-n is the cr-algebra generated by all individuals up to the n-th generation, 
= cr{XkTk £ T„}. In all the sequel, we shall make use of the following 
moment and independence hypotheses. 

(HN.l) For all n > and for all k e G„+i, Sk belongs to L^. Moreover, there 
exist (cr^^T^,^^) e {0,+oof, (|p'|,:/^A*) € [0,1)3 such that : 

• Vn > and k G G„+i, 

E[£fc|-F„] = 0, E[el\J^„] = a^, E[et\J^n] = r\ E[e||J-„] - 

• Vn > yk^le G„+i with [k/2] = [1/2], 

nekei[Tn] - p = p'a^, n4k4k+i\^n] = ly^T^ E[e^^k4k+i\^n] - A^A^s a.s. 

(HN.2) For all n > the random vectors {(e2fe, £2fc+i), k e G„} are condition- 
ally independent given J-"„. 

2.2. Observation process 

We now turn to the modeling of the observation process. The observation process 
is intended to encode if a datum is missing or not. The natural property it has 
thus to satisfy is the following: if the datum is missing for some individual, it is 
also missing for all its descendants. Indeed, the datum may be missing because 
of the death of the individual, or because the individual is the last of its lineage 
at the end of the data's gathering, see Figure 3 for an example of partially 
observed tree. 
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Figure 3. The tree associated with the observed data of the tree in Figure 1. 
2.2.1. Definition of the observation process 

Mathematically, we define the observation process, {5k)keT, as follows. We set 
61 = 1 and define recursively the sequence through the following equalities: 

52k ^hCl and ,52fc+i=4Cfe, (2.3) 

where (C^ = (CfciCfc)) is a sequence of independent random vectors of {0,1}^, 
standing for the number (0 or 1) of descendants of type i of individual k. 
The sequences (Cfe, k G 2N*) and (^^., A; £ 2N + 1) are sequences of identically 
distributed random vectors. We specify the common laws of these two sequences 
using their generating functions, /f^"^ and /^^^^ respectively: 

/(")(so,si) = p(o)(0,0)+p(")(l,0)so+p(°)(0,l)si+p(o)(l,l)sosi, 
/(i)(so,si) = pW(0,0)+p(i)(l,0),so+p«(0,l)si+p(i)(l,l)soSi, 

where p^^^jo, ji) is the probability that an individual of type i gives birth to jo 
descendants of type 0, and ji of type 1. The sequence {5k) is thus completely 
defined. We also assume that the observation process is independent from the 
BAR process. 

(HI) The sequences {5k) and {C,k) are independent from the sequences {Xk) 
and {sk). 

Remark that, since both Qk and Q. take values in {0, 1} for all k, the observation 
process {5k) is itself taking values in {0, 1}. Finally, Equation (2.3) ensures that 
if (5fc = for some fc > 2, then for all its descendants j, 5j =0. In relation with 
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the observation process (6^), we introduce two filtrations: Z„ — cr{^j,,A; G T„}, 
On = k E T„}, and the sigma field O — <T{Sk, fc € T}. Notice that On+i C 

Zn- We also define the sets of observed individuals as follows: 

G; - {fee G„ : 4 = 1} and T,* = {fc G T„ : 4 = 1}. 

Finally, let £ be the event corresponding to the cases when there are no indi- 
vidual left to observe. More precisely, 

£=[j{\K\=0}- (2.4) 

n>l 

We will denote £ the complementary set of £. 
2.2.2. Results on the observation process 

Let us introduce some additional notation. For n > 1, we define the number 
of observed individuals among the n-th generation, distinguishing according to 
their types: 

ZO-|G:n2N| and = |G; n (2N + 1)|, (2.5) 

and we set, for all n > 1, = {Z^, Z^). Note that for i E {0, 1} and n > 1 one 
has 

Zn = X! 

feGG„_i 

One has Gq = Go = {1}, but, even if 1 is odd, the individual whose lineage we 
study may as well be of type as of type 1. Consequently, we will work with 
possibly two different initial laws: P(-|.Zo — ej, for i E {0, 1}, where Cq ~ (1, 0) 
and ei = (0,1). The process {Zn,n > 0) is thus a two-type Galton- Watson 
process, and all the results we are giving in this section mainly come from [12]. 
Notice that the law of Ck , for even fc, is the law of reproduction of an individual 
of type 0, the first component of ^j. giving the number of children of type 0, the 
second the number of children of type 1. The same holds for (^j,, with odd k, 
mutatis mutandis. This ensures the existence of moments of all order for these 
reproduction laws, and we can thus define the descendants matrix P 

p ^ ( P"o Poi \ 
V Pio Pll J ' 

where = P^"\^,0)+P^'\^A) andpa = p^'\0,l)+p^''>{l,l),foT i E {0, 1}. The 
quantity pij = £[^2+ J is thus the expected number of descendants of type j of an 
individual of type i. We also introduce the variance of the laws of reproduction: 
4 = mC'2+i-Pij)% for (i,j) e {0,1}2. Note that ^ = p,^.(l - p.^). It is 
well-known (see e.g. Theorem 5.1 of [12]) that when all the entries of the matrix 
P are positive, P has a positive strictly dominant eigenvalue, denoted tt, which 
is also simple. We make the following main assumptions on the matrix P. 
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(HO) All entries of the matrix P are positive: for all G {0, 1}^, Pij > 0, 
and the dominant eigenvalue is greater than one: tt > 1 . 

Hence, still following Theorem 5.1 of [12], we know that there exist left and 
right eigenvectors for tt which are positive, in the sense that each component 
of the vector is positive. We call y = {y'^,y^y such a right eigenvector, and 
z — (z'^jZ^) such a left one; without loss of generality, we choose z such that 
z'^ + z^ — 1. Regarding the two-type Galton- Watson process (Zn), tt plays the 
same role as the expected number of offspring, in the case of standard Galton- 
Watson processes. In particular, tt is related to the extinction of the process, 
where the set of extinction of (Zn) is defined as U„>i{Z„ = (0, 0)}. Notice that 
{Z„ = (0, 0)} = {Z^ + Z^ = 0} = = 0}, so that this set coincides with £, 

defined by Eq. (2.4). Now let q = (g°, g^), where, for i e {0, 1}, 

^¥{£\Zo^ei). 

The probability is thus the extinction probability if initially there is one 
individual of type i. These two probabilities allow to compute the extinction 
probability under any initial distribution, since ¥{£) = E[(gO)Zo"(gi)Z(5]^ thanks 
to the branching property. Hypothesis (HO) means that the Galton- Watson 
process (Zn) is super-critical, and ensures that < < 1, for both i = and 
i ~ I. This immediately yields 

¥{£) < 1. (2.6) 

Under that condition, we also have the existence of a non-negative random 
variable W such that for any initial distribution of Zq 

Z — 1 " 
lim — = lim y"Ze = Wz a.s.. (2.7) 

i=0 

It is well-known that {W = 0} = £ a.s., so that the set {W > 0} can be 
viewed as the set of non-extinction £ of {Zn), up to a negligible set. These 
results give the asymptotic behavior of the number of observed individuals, 
since |G* I = Z^^ + Zl and |T* | = ELo(^° + Zj): 

lim ^= lim ""7^ \K\^W a.s. 

Roughly speaking, this means that tt" is a deterministic equivalent of |T* | and 
Eq. (2.7) implies that is the asymptotic proportion of cells of type i in a given 
generation. We will thus very often replace |T* | by tt" for computations, and 
the next lemma will be used frequently to replace tt" by |T* |. 

Lemma 2.1 Under assumption (HO), we have 

tt" tt - 1 1 , 

n^+oo 11* TT W 
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2.3. Joint model 



The model under study in this paper is therefore the observed BAR process 
defined by 

f (>2kX2k = S2k (a + bXk + £2fe), 

1 ^2fc+l-^2fe+l = ^2fc+l (c + dXk + e2k+l)- 

The aim of this paper is to study the sharp asymptotic properties of the least- 
squares estimators of the parameters (a, b, c, d) and the variance matrix of the 
noise process. 



3. Least-squares estimation 

Our goal is to estimate = (a, b, c, d)* from the observed individuals up to the 
n-th generation, that is the observed sub-tree T* . 



3.1. Definition of the estimators 



We propose to make use of the standard least-squares (LS) estimator 0„ which 
minimizes 



E 

fceT„_i 



S2k{X; 



2k 



bX. 



'■J2k+1 



{X: 



2k+l 



dXk 



Consequently, we obviously have for all rt > 1 



{On) = 

where, for all rt > 0, 




E 



V d„ / 



52kX2k 
S2kXkX2k 
S2k+lX2k+l 
\ S2k+lXkX2k+l j 



(3.1) 



and S\ 



/cGT„ 



nk^ 



1 

Xk 



Xk 



for i G {0, 1}. In order to avoid intricate invertibility assumption, we shall 
assume, without loss of generality, that for all n > 0, S„ is invertible. Otherwise, 
we only have to add the identity matrix I4 to S„, as Proposition 4.2 states that 
the normalized limit of S„ is positive definite. 

Remark 3.1 Note that when all data are observed, that is when all Sk equal 
1, this is simply the least squares estimator described in the previous literature. 
However, one must be careful here with the indices in the normalizing matrix, 
as there are now two different matrices S'^ and S'^, while there was only one in 
the fully observed problem. The intuitive way to deal with missing data would be 
to restrict the sums to the observed data only. Note that our estimator is more 
complex as it involves sums depending on the absence or presence of even- or 
odd-type daughters of the available data. 
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We now turn to the estimation of the parameters and p. We propose to 
estimate the conditional variance and the conditional covariance p by 



e2fe£2fc+l, 



where for all k E G„, 

{£2k = S2k{X2k — On — bnXk), 

£2k+l — S2k+l{X2k+l — Cn — dnX^). 

and 

T:°i = {fc e T„ : 52fc'52fe+i = 1}, 

so to speak is the set of the cells of the tree T„_i which have exactly two 

offspring. 



3.2. Main results 

We can now state the sharp convergence results we obtain for the estimators 
above. We introduce additional notation. For i E {0, 1}, let us denote : 



where z = (2*^,2^) is the left eigenvector for the dominant eigenvalue vr of 
the descendants matrix P introduced in section 2.2.2, /i% A:* are defined in 
Propositions 6.3 and 6.5 and the four terms of L"'^ defined in Proposition 6.6. 
We also define the 4x4 matrices 

j ' r ^ ^^0, ^,^1 j . (3.2) 

Our first result deals with the strong consistency of the LS estimator 0„. 

Theorem 3.2 Under assumptions (HN.l), (HN.2), (HO) and (HI), con- 
verges to 6 almost surely on £ with the rate of convergence 

i{iG.|>o}l|0« -ef = o (^°|?^) % (3-3) 

In addition, we also have the quadratic strong law 

lim l{|G.|>o}- V|Tr,*_i|(0,-0)*S(0f-0) = 4^ aH^ a.s. (3.4) 
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Our second result is devoted to the almost sure asymptotic properties of the 
variance and covariance estimators and p„. Let 



Theorem 3.3 Under assumptions (HN.l), (HN.2), (HO) and (HI), con- 
verges almost surely to on S. More precisely, one has 

lim l/|G.|>o}- (52/c+j(£2fc+i - e2/c+j)^ = 4(7r - l)fT^l£ a.s. (3.5) 

feGT„_i i=0 
IT* I 

lim l{|G.|>o} — (J??.-a2)=4(7r-l)a2% a.s. (3.6) 

In addition, p„ converges almost surely to p on £ and one has 

lim l{|G*|>o}— ^2fc(£2fc — e2fe)'52fc+i(e2fe+i — £2fc+i) 

fcGT„_i 

= 4p^ir((ii)-i/2i0,i(^0)-i/2)j_ ^..^ (3 7) 

lim l{|G.|>o}^(p„-p«)=p4^ir((ii)-i(L°^i)2(iO)-i)l^ 

(3.8) 

Our third result concerns the asymptotic normality for all our estimators 0„ , tr^ 
and Pn given the non-extinction of the underlying Galton- Watson process. For 
this, using the fact that P(£) ^ thanks to Eq. (2.6), we define the probability 
Pf by 

_n£) 
P(f^ 



V^{A) = ' for all A ^ A. 



Theorem 3.4 Under assumptions (HN.l), (HN.2), (HO) and (HI), we have 
the central limit theorem 



|T*_il(e„-0) ^AA(0,I]-ir£-i) on (f,P^). (3.9) 
In addition, we also have 

(3.10) 

where p{\, 1) is defined in Eq. (6.6) and 
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The proof of our main results is going to be detailed in the next sections. 
It is based on martingale properties, and we will exhibit our main martingale 
(iW„) in Section 4. Sections 5 to 7 are devoted proving to the sharp asymptotic 
properties of (Mn). Finally, in Section 8 we prove our main results. Before 
turning to the definition of the martingale (Mn), we present a short application 
of our estimation procedure on data. 

3.3. Results on real data 

The biological issue addressed by Stewart et al. in [15] is aging in the single cell 
organism Escherichia coli, see also [7] for further biological details. E. coli is a 
rod-shaped bacterium that reproduces by dividing in the middle. Each cell has 
thus a new end (or pole), and an older one. The cell that inherits the old pole 
of its mother is called the old pole cell, the cell that inherits the new pole of 
its mother is called the new pole cell. Therefore, each cell has a type: old pole 
(even) or new pole (odd) cell, inducing asymmetry in the cell division. 

Stewart et al. filmed colonies of dividing cells, determining the complete lin- 
eage and the growth rate of each cell. Their statistical study of the averaged 
genealogy and pair-wise comparison of sister cells showed that the old pole cells 
exhibit cumulatively slowed growth, less offspring biomass production and an 
increased probability of death. Note that their test assumes independence be- 
tween the averaged pairs of sister cells which is not verified in the lineage. 

Another analysis was proposed in [9]. They model the growth rate by a 
Markovian bifurcating process, allowing single-experiment statistical analysis 
instead of averaging all the genealogical trees. Asymptotic properties of a more 
general asymmetric Markovian bifurcating autoregressive process are then in- 
vestigated in [8], where a Wald's type test is rigorously constructed to study 
the asymmetry of the process. These results cannot be compared to ours be- 
cause this model does not take into account the possibly missing data from the 
genealogies, and it is not clear how the author manages them, as not a single 
tree from the data of [15] is complete. In [5[, the authors take missing data into 
account but, contrary to our approach, they allow different sets of parameters 
for cells with two, one or no offspring, making the direct comparison with our 
estimator again impossible. 

We have applied our methodology on the set of data penna-2002-10-04-4 
from the experiments of [15[. It is the largest data set of the experiment. It 
contains 663 cells up to generation 9 (note that there would be 1023 cells in a 
full tree up to generation 9). In particular, we have performed 

• point estimation of the vector 6, 

• interval estimation for the coefficients (a, &, c, d), 

• Wald's type symmetry tests for the entries of On- 
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parameter 


a 


c 


estimation 
C.I. 


0.03627 
[0.03276; 0.03979] 


0.03058 
[0.02696; 0.03420] 


parameter 


b 


d 


estimation 
C.I. 


0.02662 
[-0.06866; 0.12191] 


0.17055 
[0.07247; 0.26863] 



Table 1 



Estimation on the data set penna-2002-10-04-4 



Table 1 gives the estimation 9g of with the 95% Confidence Interval (C.I.) 
of each coefficient. The variance given by the CLT for 6 in Eq. (3.9), is ap- 
proximated by I]~^r„S~^ thanks to the convergence given in Corollary 4.3. 
The confidence intervals of b and d show that the non explosion assumption 
(|6| < 1 and \d\ < 1) is satisfied. Some empirical computation on the process 
{6k) gives the following estimation for the highest eigenvalue of the Galton- 
Watson process : tt = 1.35669 (with confidence interval [1.27979,1.43361], see 
[14]), also satisfying the super-criticality assumption. Wald tests of comparison 
between the coefficients of have been deduced of the CLT. The null hypotheses 
{a,b) — {c,d) (resp. a = c, b = d) are rejected with p-values p= 0.0211 (resp. 
p= 0.0158 and p=0.0244). Hence on this data set the cell division is indeed 
statistically asymmetric. 



4. Martingale approach 

To establish all the asymptotic properties of our estimators, we shall make 
use of a martingale approach, similar to [3]. However, their results cannot be 
used in our framework, since the randomness comes now not only from the state 
process, but also from the time space (genealogy). These two mixed randomness 
sources require careful choice between various filtrations, and stronger moment 
assumptions on the driven noise of the BAR process. For all n > 1, denote 

Mn = ^ (^2fc£2fc, S2kXk£2k, S2k+l£2k+l, '52fc+lXfee2fc+l)* . 

Thus, for all n > 2, we readily deduce from Equations (3.1) and (2.1) that 

/ S2k£2k \ 

62kXk£2k 
<^2fe+l£2fc+l 
\ 62k+lXk£2k+l / 

The key point of our approach is that (iVf„) is a martingale for a well chosen 
filtration. 



On s„ii 2J 

fceT„_i 



^-\Mn. (4.1) 



^.1. Martingale property 

Recall that O — a{5k^ A; g T} is the a-field generated by the observation process. 
We shall assume that all the history of the process {5k} is known at time and 
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use the filtration F'^ = {J^^) defined for all n by 

J-f = O V a{5kXk,k e T„} = O V a{Xk, k e TJ, 

where J- V G denotes the ci-field generated by both and G- Note that for all 
n, is a sub cr-field of O V Tn- 

Proposition 4.1 Under assumptions (HN.l), (HN.2) and (HI), the process 
(Mn) is a square integrable V''^ -martingale with increasing process given, for all 
n>\, by 



<M>„=Tn-l = 



c.0,1 



c-O.l 



where and S\ are defined in section 3.1 and 



1 



feGT„ 

Proof : First, notice that for all n > 1, one has 



Xk 



\ 



AM, 



M„ - M„ 



E 



fcSG,, 



52k£2k 
S2kXk£2k 
^2/c+l£2fe+l 
\ S2k+lXkS2k+l / 



Now, we use the fact that for all n, T'^ is a sub-cr field of O V Tm the indepen- 
dence between O and Tn under assumption (HI) and the moment hypothesis 
(HN.l) to obtain 

E[<52fc£2fc I -^f-i] = <J2feE[E[£2fe I OV J-„_i] I 

= ^2fcE[E[£2fe I I = 0. 

We obtain similar results for the other entries of A7Vf „ as 82k+\ and Xk are 
also J^^_]^-measurable. Hence, {M.n) is a F'-' -martingale. It is clearly square in- 
tegrable from assumption (HN.l). The same measurability arguments together 
with assumption (HN.2) yield 



E[AM„(AM„)* I 

/ oH2k 



E 



fcet 



hkXk 



(j'^52kXk 
cJ^52kXl 



P^2kS2k+l P^2kS2k+lXk 
V pS2kS2k+lXk p52k52k+lXl 



phkhk+i 

pS2k^2k+lXk 
<y'^52k+l 

cr'^S2k+iXk 



pS2k^2k+lXk \ 
pS2kS2k+lXl 
Cr'^52k+lXk 



^^52k+iXl 



I 



Hence the result as < M >„= X^Li I^I^-^f (AM^)* | JJiJ 



□ 



Our main results are direct consequences of the sharp asymptotic properties 
of the martingale (Af„). In particular, we will extensively use the strong law of 
large numbers for locally square integrable real martingales given in Theorem 
1.3.15 of [6]. Throughout this paper, we shall also use other auxiliary martin- 
gales, either with respect to the same filtration F'-', or with respect to other 
filtrations naturally embedded in our process, see Lemma 5.1. 
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4.2. Asymptotic results 

We first give the asymptotic behavior of the matrices S]^ and 5'^'^. This is 
the first step of our asymptotic results. 

Proposition 4.2 Suppose that assumptions (HN.l), (HN.2), (HO) and (HI) 

are satisfied. Then, for i G {0, 1}, we have 

^0,1 

^lh-n^l{|G;|>o}^ = a.s. and Jun 1{|g.j>o} ^ = Ifi"'' a.s. 

In addition, L'^ and , hence S are definite positive. 

A consequence of this proposition is the asymptotic behavior of the increasing 
process of the martingale (Af„). 

Corollary 4.3 Suppose that assumptions (HN.l), (HN.2), (HO) and (HI) 

are satisfied. Then, we have 

s r 

Jim l{|G.|>o}^ = %S, and Jim 1{|g. |>o}^ = %r. 

This result is the keystone of our asymptotic analysis. It enables us to prove 
sharp asymptotic properties for the martingale (Mn). 

Theorem 4.4 Under assumptions (HN.l), (HN.2), (HO) and (HI), we have 

l{|G.|>o}M^S;liM„ = 0{n) a.s. (4.2) 
In addition, we also have 

1 " 

lim l{|G.|>o}- VM*S7_\M, = 4^21 a.s. (4.3) 

Moreover, we have the central limit theorem on (£,¥j) 

^ Mn^JV{0,T) on (£,P^). 



As seen in Eq. (4.1), (0„ — 6) is closely linked to iVf„ and this last theorem is 
then the major step to establish the asymptotic properties of our estimators. 
The proof of this Theorem is given in Section 7. As explained before, it is a 
consequence of Proposition 4.2 which proof is detailed in Section 6. In between, 
Section 5 presents preliminary results in the form of laws of large number for 
the observation, noise and BAR processes. 



5. Laws of large numbers 

We now state some laws of large numbers involving the observation, noise and 
BAR processes. They are based on martingale convergence results, and we start 
with giving a general result of convergence for martingales adapted to our frame- 
work. 
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5.1. Martingale convergence results 

The following result is nothing but the strong law of large numbers for square 
integrable martingales, written in our peculiar setting, and will be repeatedly 
used. 

Lemma 5.1 Let Q — {Qn) be some filtration, (Hn) and (Gn) be two sequences 
of random variables satisfying the following hypotheses: 

(i) for all n > I, for all k G G„, Hk is Qn^i-measurahle, Gk is Qn-tneasurable, 

and ¥.[{HkGkY] < +oo, 

(ii) there exist > 0, r G [—1, 1], such that for all n > 1, for all k,p £ G„, 

{c*^ k — ' p 

rc^ ifk^pand[k/2]^[p/2], 
otherwise, 

(iii) there exists a sequence of real numbers (a„) that tends to oo such that 
Then X^fceT H^Gk is a Q -martingale and one has 



- V HkGk = a. 
r_ ^ — ^ 



lim 

fceT„ 

Proof: Define _D„ — J^keT H^Gk- Assumptions (i) and (ii) clearly yield that 
(Z3„) is a square integrable martingale with respect to the filtration {Gn) - Thanks 
to (ii), its increasing process satisfies 

<D>^ - ^ if2 + 2r H2kH2k+i) 

fcGT„ feeT„_i 



< c\r + l)YHl 



keT„ 

and now, (iii) implies that < D >„= 0{an)- Finally, since the sequence (a„) 
tends to oo. Theorem 1.3.15 of [6] ensures that D„ = o(a„) a.s. □ 

We also recall Lemma A.S of [3] that will be useful in the sequel. 
Lemma 5.2 Let (An) be a sequence of real-valued matrices such that 

oo n 

E\\An\\ < OO and lim = A. 

n— ^oo — ^ 

n=0 k=0 

In addition, let (X„) be a sequence of real-valued vectors which converges to a 
limiting value X. Then, 



lim y An^iXi = AX. 



n— >-oo 

1=0 
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5.2. Laws of large numbers for the observation process 

We now give more specific results on the asymptotic behavior of the observation 
process {Sk)k>i- Recall the notation TJ^ defined in (2.2). 

Lemma 5.3 Under the assumption (HO), we have the following convergences, 
for {i,j) in {0, l}^ 



lim 62k+j = Pij ^ Wz' 

n-i-+oo tt" ■' TT — 1 

l-i.+oo tt" ^ — TT — 1 



J 



Proof: Recall that 52k+j — ^kCi: so that 



Z] ) + D „ 



since Go = {1}, so that T\ contains 1 or not, according to i = 1 or not, and where 
= X^feeT* ^k{!^k ^ Pij)- To deal with D„, we use Lemma 5.1, with Q = {Z^) 

(recall that Z„ = criCki^ ^ ^n}), Hk = 41{feeT'}, and Gk = {(1 - Pij)l{feeT'}- 
Assumption (i) of Lemma 5.1 is obviously satisfied, since 6k, for k € G„, is 
^„_i-measurable. Regarding (ii), since the sequence (C^) is a sequence of i.i.d. 
random variables with expectation pij and variance afj , we have ]E[Gfc |-E„_i] 
and E[G||Z„_i] = af^, for k e G„, and E[GfeGp|Z„_i] = 0, for fc 7^ p e G„. 
Finally, we turn to assumption (iii): 

n 

fceT„ keT',, i=i 

thanks to (HO) and Eq. (2.7). Finally, £)„ o(7r"), and again using Eq. (2.7), 
we obtain the first limit. The proof of the second one is similar using the Z- 
martingale: 

E SkiS2kS2k+i -p^'Hhl)) = E l{fceT'}41{fcer}(a°4' -p^'Hhl)), 

and Lemma 5.1 again. □ 
5.3. Laws of large numbers for the noise process 

We need to establish strong laws of large numbers for the noise sequence (e„) 
restricted to the observed indices. 
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Lemma 5.4 Under assumptions (HN.l), (HN.2), (HO), (HI) and for i G 

{0, 1}, one has 

lim — S2k+i£2k+i = a.s. 

fceT„_i 

Proof: Set 

We use Lemma 5.1, with Q = {¥^_^_^). Assumption (i) is obvious. For k e GJj+i, 
we have E[Gfc|F^+i] = and E[Gl\¥^^,] = a^, and E[GfeGp|F^+i] = 0, for 
k ^ p Cz Gil^^i - Finally, we turn to assumption (iii): 

ri+l 

fceT„ feGT„ e=i 

as n tends to infinity, thanks to Eq. (2.7). We obtain the result. □ 

Lemma 5.5 Under assumptions (HN.l), (HN.2), (HO), (HI) and for i G 
{0, 1}, one has 



iSk = a^-^Wz' a.s. 



lim — eli^ 

n-^+oc tt" ^-^ TT — 1 

feeT^\To 

1™ ^ E '52fc^2fe+ie2fee2fc+i = PP^'-'(1,1) T-Wz' a.s. 



fcGTj^\To 

Proof: In order to prove the first convergence, we apply again Lemma 5.1 to 
the F'-'-martingale: 

Qn= E i^k-'^^)^k= E l{fceT'}4 l{fcgT'}(gfc - O"^) , 

^k ^k 

Under (HN.l), (HN.2), we have E[Gfc|F,^] = and E[G^|F^] = - cr^ and 
E[GfeGp|F^] = 0, for A: 7^ p e G„. Thanks to Eq. (2.7), we have: 

1 1 " 

^E^'= = — E^^ >—,Wz^ a.s. 

which both implies assumption (iii) and the first convergence. To prove the 
second convergence, we write 



— E ^2k62k+l£2k£2k+l 

— E l{fe6T»}'52fc^2fc+l l{feeT'}(£2fee2fc+l - P) H -P E '^2fe'52/c+l 



fcGT^\To 



nk <Jfc 
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We use Lemma 5.1 to prove that the first term converges to ; Lemma 5.3 gives 
the Hmit of tlie second term. □ 



Corollary 5.6 Under assumptions (HN.l), (HN.2), (HO), (HI) and for i e 

{0, 1}, one has 

lim £lS2k+] = <y^Pij ^ -, Wz" a.s. 

fceT,\\To 

1™ ^ ^2k52k+i£2k£2k+i = PP(1,1) tW a.s. 

fceT„\To 

Proof: The proof of tlie first Hmit is similar to the preceding ones, using the 
decomposition 52k+j — SkCl ^^^^ properties of the sequence (C^). Using 
Lemma 5.5 the second one is straightforward. □ 



Lemma 5.7 Under assumptions (HN.l), (HN.2), (HO), (HI) and for i € 

{0, 1}, we have 

lim — Sket = r'^ ^ , VFz' a.s. 

feeT,\\To 

1™ \ E 52k52k+i£lkelk+i = v\S('\\,\) — Wz^ 

Proof : The proof follows essentially the same lines as the proof of Lemma 5.5 
using the square integrable real martingales 

(3« = E ^kiet^^'^)^ and i?„ = ^ (52j52j+i(£2j£2j+i " J^^'^**)- 

fceT;,\To fceTj,\To 

It is therefore left to the reader. □ 



6. Convergence of the increasing process 

We can now turn to the proof of our keystone result, the convergence of the 
increasing process of the main martingale (7V^„). 



6.1. Preliminary results 

We first need an upper bound of the normalized sums of the S2n+iX^i, and 
S2nS2n+iXj^ bcforc being able to deduce their limits. 

Lemma 6.1 Under assumptions (HN.l), (HN.2), (HI) and (HO), and for 

i G {0, 1}, we have 

S2k+^Xl = 0(7r") and ^ S2kS2k+iXl = 0(^") a.s. 
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Proof: In all the sequel, for all fc > 1, define — a, ^2*; = b, a2k+i — c, 
^2fc+i — d and rjk = au + Sfc with the convention that 771 = 0. It follows from a 
recursive application of relation (2.1) that, for all fc > 1, 



^^=(n''[^i)^i+E(n^[*i)^[^i' (6-1) 

with the convention that an empty product equals 1. Set a — max(|a|, |c|), 
f3 = max(|6|, \d\) and notice that < /3 < 1. The proof of Lemma A. 5 in [3] 
yields 

fcGT„\To 

4 40-2 

^ T3^^" + T3^^" + 2^iC'n> (6.2) 
where, for i e {0, 1}, 

fceT„\To f=0 ^ fe6T„\To e=0 feeT„\To 

The last two terms above are readily evaluated by splitting the sums genera- 
tion-wise. Indeed, the last term can be rewritten as 

We now use Lemma 5.2 with An = 7r~" and X„ = /3^"Z,\^]^7r^". On the one 
hand, the series of (tt"") converges to 7r/(7r— 1) as tt > 1 by assumption; on the 
other hand, tends to as n tends to infinity as /3 < 1, and Z^7r~" converges 
a.s. to Wz"^ according to Eq. (2.7), hence /3^"Z,\^]^7r^" tends to as n tends to 
infinity. Lemma 5.2 thus yields 

n 

lim y(^-i)"-^(;32^^) =0 and = o(7r") a.s. 
We now turn to the term B^: 

£=1 kGGi ^ ^ ^' e=l fceGf ^ ^' 
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due to Lemma 2.1. It remains to control the first term A^. Note that Sk appears 
in as many times as it has descendants up to the n-th generation, and its 
multiplicative factor for its p-th. generation descendant k is /3P52k- This leads to 

n n-t 2P-1 

= X! X! X! X! '^2(2Pfc+m)+i- 

i=l keGt P=0 m=0 

2^ — 1 2^ — 1 

Now, note that Y.m=o ^2{2Pk+m)+i = 4 Em=o ^2{2Pk+rn)+z IS the number of 
descendants of type i of individual k after p + I generations. We denote it 
Zp^i{k), and split A^^ the following way: 

= E E ^'E/5''5'=^;+i(^) +E E (^l a2)E/3"^'=^;+i(^)- (6-3) 

i=i feeGf p=o i=i fceGf p=o 

We first deal with the second term of the above sum. 

n n—i n— 1 n— p 

EE(^^-^')E/5''^^-^;+iW = E/^^'E E(^'-^')^'=^;+i(^) 

1 n—p 
p=0 1=1 

where F/^ — J^keG/i^k ~ '^^)4^p+i(^)- Tedious but straightforward compu- 
tations lead to the following expression for the second order moment of F/^, 
relying on assumptions (HI), (HN.l) and (HN.2). We also use the fact that, 
for k G Gi, conditionally to {6k = 1}, Zp^i{k) follows the same law as Zp_^i, 
and is independent of any Zp^i{k'), for k' ^ k E G^. 

niYl,)'] = {r^~a^)E[Z'^ + Zm{Z;+ir] 

+(i.V-a4)E[Z;+i]2E[ E S2kS2k+i 

< (r^ - a^)E[Z^ + Z\\ (e[(Z;+i)^] + nz^if) , 

since J2keGe-i ^2kS2k+i < J2keGe-i (^2fc + S2k+i) = + Z\. Now, using results 
on the moments of a two-type Galton- Watson process (see e.g. [12]), we know 
that E[(Z^+i)2] = 0(7r2p). Recall Eq. (2.7) to obtain that MXl^f] = ©(tt^tt^p), 

which immediately entails that 15^*^1 — o(7r"'^7r'*'P) a.s., for any a > 1/2 and 
7 > 1. We thus one gets 

ji — 1 n—p 

T^f^'T. np = o{{^ii^^r) = o{nn 

P=o e=i 

since we can choose 7 close enough to 1 to get (Sir"' < tt, as /3 < 1. We have thus 
proved that the second term in the sum in (6.3) is 0(7r"), we now turn to the 
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first one 

n n—i 

n n—i n n—i 

1=1 p=0 keGi t=l p=0 

1 ji— p n— 1 

Finally, = ©(tt"), and the first result of the Lemma is proved. The second 
result follows immediately from the remark that the second sum in Lemma 6.1 
is clearly smaller than the first one. □ 



Lemma 6.2 Under assumptions (HN.l), (HN.2), (HI) and (HO), and for 

i G {0, 1}, we have 



J2 52k+^Xt = 0(7r") and ^ 52k52k+iXt = 0(7r") a.s. 

fceT„ /ceT„ 

Proof: The proof mimics that of Lemma 6.L Instead of Equation (6.2), we have 

64 ^ 64a4 

(wF "^(wF' 



fe6T„\To 

with, for i in {0, 1} 

Aj, = ^^^+^ E P'^t^V ^" = E E = E '52fe+./3''-'=. 

fe6T„\To 1=0 ^ fceT„\To «=0 feeT„\To 

We can easily prove that (_B^ + C^) = ©(tt"). Therefore, we only need a sharper 
estimate for A\^. Via the same lines as in the proof of Lemma 6.1, but dealing 
with £\ instead of e^, we can show that A\^ — 0{'k'^) a.s. which immediately 
yields the first result. The second one is obtained by remarking that the second 
sum is less than the first one. □ 



6.2. Asymptotic behavior of the sum of observed data 

We now turn to the asymptotic behavior of the sums of the observed data. More 
precisely, set HJ^ = X]fcGT„ hk+iXk, for i in {0, 1}, and If„ = {H^,H\f. The 
following result gives the asymptotic behavior of [Hn). 

Proposition 6.3 Under assumptions (HN.l), (HN.2), (HI) and (HO), we 

have the convergence: 

lim — - — Wh a.s., 

n— ^oo TT^ TT — 1 
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wh 



ere 



Proof: We first prove that the sequence {Hn) satisfies a recursive property 
using Equation (2.1). 

feeT^i fceTO i;eTi\To fceTi\To 

fceT„\To 

with 

B° = Xi(52 + a ^ (52fc + c ^ (52fc + ^ £fe(52fc 

fceT^ fc6Tj^\To fcGT„\To 

+b ^ -'«^fe^2*;(<54fc -Poo) + X! ^fc'^2fe+l('54fc+2 -Pio)- 

fceT„_i feeT„„i 

Similarly, we have 
with 

-^n = ^l^i + X! "^^fe+l + C ^ (52fc+l + ^ £fc^2fc+l 
fceTO fceT,\\To fceT„\To 

+ b ^ ^fc(52fc(l54fc+l - Poi) + ^ Xfe(52fc+l(l54fc+3 - pii). 

Let us denote B„ = (_B,", B}y- . The last equations yield in the matrix form: 

fe=i 

with 

— n 

One has ||Pi|| < 7r-"/3"||P"||. It is well known that 7r""P" converges to a 
fixed matrix (see e.g. [13]) as P is a positive matrix with dominant eigenvalue 
TT. Since /? < 1, the sequence thus converges to as n tends to infinity. In 





' b 
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addition, 2_/ll-fill bounded, I2 — Pi is invertible and 2_/ji>o -^i converges 
to (I2 — Pi)^^. In order to use Lemma 5.2, we need to compute the limit of 
Bji/tt". First, we prove that 

^ efc52fc+,=oK), (6.4) 

feeT„\To 

for i G {0, 1}, thanks to Lemma 5.1. Indeed, set Q = ¥^ , = ^2fc+i, Gk = Sk- 
Thus hypothesis (i) of Lemma 5.1 is obvious, (ii) comes from (HN.l) and 
(HN.2). Finally, the last assumption (iii) holds, since 

ji+i 

feeT„\To 1=1 

the last equality coming from (2.7), which holds thanks to (HO). Now, we turn 
to the terms 

E ^k52k+i{52{2k+i)+j — Pij) = E ^fc^2fc+» (CL+j ^ P'l]) ^ 

fceT„ ke^,^ 

for (z, j) e {0, 1}^. We use again Lemma 5.1, with the following setting: (t/„) = 
(2^„+i V Tn+i), Hk = Xk62k+i, Gk = C2fc+i - Pir For k e G„, we check 
that Xk 52k+i is t/n-i-measurable, since is J-'n-measurable and 52k+i is 
measurable. Next, because of (HI) and of the independence of the sequence 
(Cfe), — pij|Z„V — 0. The same independence hypothesis yields that 

'¥\GkGp\Zn V Fn] 7^ only if fc = p, and then equals afy Finally, 

E W2k+^f = E ^l^^k+r = 0(^"), 

fceT„ fceT„ 
thanks to Lemma 6.1. Now, Lemma 5.1 allows to conclude that 

E ^k52k+t{52(2k+i)+] -Pij) = o{n"), (6.5) 

keTn 

for e {0,1}^. Next, Lemma 5.3 gives the limit of the term J^keT' ^2k+j, 

for e {0, 1}^, so that we finally obtain: 

lin, ^ = W— ( + '^P'^ ) ^ W—P' 

and we use Lemma 5.2 to conclude. □ 



Remark 6.4 Putting together Proposition 6.3 and Eq. (6.5) above, we imme- 
diately get that under the same assumptions as that of Proposition 6.3, 

^™ ^ E ^k52k+i52(2k+i)+j = -h'pijW a.s. 

n— >oo TT ^ — ^ TT — i 

for all {i, j) G {0, 1}^, result we will use for the study of the limit of^ X'^S2k+i- 
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6.3. Asymptotic behavior of the sum of squared observed data 

We now turn to the asymptotic behavior of the sums of the squared observed 
data. Set K'^ = EfeGT„ '^2fe+^^|, for i in {0,1}, and K„ = {K°,K^y. The 
following result gives the asymptotic behavior of {Kn). 

Proposition 6.5 Under assumptions (HN.l), (HN.2), (HI) and (HO), we 

have the convergence: 

lim = W K a.s., 

n— foo TT^ TT — 1 



where 



and 



k-( p rip* f + + f 



p _^ptf 



Proof: We use again Equation (2.1) to prove a recursive property for the se- 
quence {Kn). Following the same lines as in the proof of Proposition 6.3, we 
obtain: 

Kn _ ^ Kn-1 Cn _ fc- i B"^^ 

- + - ^2Ko + 2^P2 -^i 

where C„ = (C°, C;^)* is defined by 

= X^h+i + a'^ hk+i + b'^ ^ Xl62k{S4k+i - Poi) 

+2ab ^ XkS2k64k+i + 2a ^ efc<^2fc+i + 26 ^ X^k-^£kS2k+i 
feeT„_i feeTO kejo 

+ ^ eihk+i + X! '^2fc+i + X! -^fe'52fc+l(<^4/c+2+-t - Pl-i) 
fc6T„\To *:eT,\\To /ceT„_i 

-|-2cd ^ Xfe(52fe+i<54fc+2+i + 2c ^ efc(52fc+i + 2d ^ Xj|jefc(52fc+i, 

for i e {0, 1}. Note that IIP2II < 7r-"/32"||P"||, so that Pj converges to 0. In 
addition, X]ll-f2ll bounded, I2 — P2 is invertible and J2n>o-^2 converges 
to (I2 — P2)~i. In order to use Lemma 5.2, we have to compute the limit of 
C'ri/'"'". Following the proof of (6.4), we already have, for G {0, 1}^, 

^ £kS2k+i = o(7r") a.s. 

keTi 

We now turn to the terms J2keT„^i Xk^2k+i{S2(2k+i)+j -Pij),ioT e {0, 1}^. 
To deal with these terms, we use Lemma 5.1 with the same setting we used to 
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prove Eq. (6.5), except that we replace Xj, with X^. Assumptions (i) and (ii) 
of Lemma 5.1 have thus already been checked, and regarding (iii), we have 
"YlikeT 1 ^k^'ik+i = ©(tt") a.s. thanks to Lemma 6.2. We conclude that 

^ Xl52k+t{52(2k+i)+j ~ Pij) ^ o{tt'^) a.s. 
fceT„_i 

Next, we study X^feeT* X[^]^k52k+j, for G {0,1}^. We use the same mar- 
tingale tool, so to speak Lemma 5.1, with Q = F^, Hk = X^k^52k+j^ {keT-} ^nd 
Gk — £k- Assumptions (i) and (ii) are easily checked, and since 

^ X'^k]S2k+] = ^ XlS2(2k+i)+j < ^ XlS2k+i ^ O^tt"'), 

the last equality coming from Lemma 6.1, assumption (iii) is satisfied and 
^ X^k^ekS2k+j = o{tt") a.s. 

Now, Corollary 5.6 yields that for i G {0, 1}, 

1™ ^ ^l^2k+i ^ (^^{PoiZ^ +PiiZ^) tW a.s. 

feeT„\To 

Finally, Remark 6.4 gives the limit of 7r~" X^feGT i ■XkS2k+i62{2k+i)+j i E^i^d Lemma 5.3 
that of 7r~" X^fcgT^ ^2k+i, so that we finally obtain 

hm £^^^(Poo P.o\f ial + + |,,,o ^^^^ 



And we conclude using Lemma 5.2 again. □ 

Propositions 6.3 and 6.5 together with Equation (2.7) give the asymptotic 
behavior of the matrices S'^ and Sl^ . The next result gives the behavior of matrix 
S°^^ given through the quantities X]fceT„ S2kS2k+iXk and X]fceT„ ^2fc^2fc+i-^fc- It 
is an easy consequence of Propositions 6.3 and 6.5, together with Lemma 5.3 
for the first limit. 

6.4- Asymptotic behavior of covariance terms 

Finally, we turn to the asymptotic behavior of the covariance terms, which 
are involved in matrix S'^'^. We thus define H^^^ — X^fceT ^2kS2k+iXk and 

= I]fceT„ S2kS2k+lXl. 

Proposition 6.6 Under assumptions (HN.l), (HN.2), (HO) and (HI), we 

have the almost sure convergences: 

1™ ^ Yl ^2k52k+i = tWp{1, 1), 

feeT,, 
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lim ^ - ^^t^/jO^i and lim ^ = ^—Wk''^\ 

n-^oo TT^^ TT — 1 n— ^oo TT^ TT — 1 

p(l,l) = p(°)(l,l)z"+p(i)(l,l)z\ (6.6) 

\ TT TT / 

\ TT TT y 

Proof: The first limit is a consequence of Lemma 5.3. Next, using Eq. (2.1) 
we obtain H^tt~'^ and K^tt~'^ in terms of 7r~"X]fcgT' ^^k, H^_iTr~" and 
/C^_]^7r^" and the result follows from Propositions 6.3 and 6.5. □ 



Proof of Proposition 4.2: We are now in a position to complete the proof of 
Proposition 4.2. Simply notice that we have proved in Propositions 6.3, 6.5 and 
6.6 all the wished convergences, except that we normalized the sums with tt". 
Thanks to Lemma 2.1, we end the proof. □ 

Remark 6.7 In the case of fully observed date, the matrix P is a 2 x 2 matrix 
with all entries equal to 1, tt equals 2 and the normalized eigenvector z equals 
(1/2, 1/2). One can check that in that case, our limits correspond to those of [3j. 



7. Asymptotic behavior of the main martingale 

Theorem 4.4 is a strong law of large numbers for the martingale (Mn)- The 
standard strong law for martingales is unhelpful here. Indeed, it is valid for 
martingales that can be decomposed in a sum of the form X]"=i ^^-i^e where 
(4'f) is predictable and (^^) is a martingale difference sequence. In addition, (^f ) 
and {^() are required to be sequences of fixed-size vectors. Such a decomposition 
with fixed-sized vectors is impossible in our context (see Lemma A. 2), essentially 
because the number of observed data in each generation asymptotically grows 
exponentially fast as tt". Consequently, we are led to propose a new strong law 
of large numbers for (Afn), adapted to our framework. 

For all n > 1, let V„ = M'^S^^i^Mn where S„ is defined in Section 3.1. 
First of all, we have 

= (M„ + AM„+i)*S-i(M„ + AM„+i), 

= V„~M*,(S,7li-S;i)M„ + 2M*,S,;iAM„+i+AM^+iS;iAM„+i. 

Note that Ml^Yl^^AMn+i and AM^^T,^^ M n+i are scalars, hence they are 
equal to their own transpose and as a result, one has TVf '^S^^ AAf „+! = 
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AM1^11~^M„^i. By summing over the identity above, we obtain the main 
decomposition 

Vn+l + An = Vi + Bn+1 + Wn+1, (7.1) 

where 

n 

An = M'iC^i-l - 5^7')^^ 

£=1 

n n 

Bn+1 = 2^M*S7iAM,+i, Wn+i = ^ AM*+iS7iAM,+i. 

£=1 1=1 

The asymptotic behavior of the left-hand side of (7.1) is as follows. 

Proposition 7.1 Under assumptions (HN.l), (HN.2), (HO) and (HI), we 

have 

Vn+l + An 4(7r- 1) 2, 

hm 1{|G*|>0} = cr % a.s. 

n— f+oo LI Til J ^ yj. c 

Proof : Thanks to the laws of large numbers derived in Sections 5 and 6, the 
proof of Proposition 7. 1 follows essentially the same lines as [3] and is given in 
Appendix A. □ 

Since (V„) and {An) are two sequences of non negative real numbers, Propo- 
sition 7.1 yields that 1{|g* |>o}"^'n = C'(n) a.s. which proves Equation (4.2). We 
now turn to the proof of Equation (4.3). We start with a sharp rate of conver- 
gence for {Mn). 

Proposition 7.2 Under assumptions (HN.l), (HN.2), (HO) and (HI), we, 

we have, for all rj > 1/2, 

1{|G.|>0} II Mn \?=o{\Tl_^n^) 

Proof : The result is obvious on £. On £, the proof follows again the same lines 
as [3] thanks to the laws of large numbers derived in Sections 5 and 6. It is given 
in Appendix B. □ 

A direct application of Proposition 7.2 ensures that 1{|g* |>o}Hi = o(n'') a.s. 
for all ?7 > 1/2. Hence. Proposition 7.1 immediately leads to the following result. 

Corollary 7.3 Under assumptions (HN.l), (HN.2), (HO) and (HI), we have 

An 4(7r - 1) 2, 

liin 1{|g;|>0} = cr If a.s. 

Proof of Result (4.3) of Theorem 4.4: First of all, An may be rewritten as 

n n 

e=i 1=1 
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where A„ — I4 — S„_j^S„ Thanks to Corollary 4.3, we know that 

TT - 1 

hm Injj. |>o}A„ Ul-g a.s. 

Besides, Corollary 7.3 yields that An ~ n^^^^Aa^ a.s. on £. Plugging these two 
results into the equality 

gives that -^^^7-1-^^ ^ -^n^,^ ^-S- on E and convergence (4.3) directly 

follows. □ 

8. Proof of the main results 

We can now proceed to proving our main results. 

8. 1 . Strong consistency for On 

Theorem 3.2 is a direct consequence of Theorem 4.4. 

Proof of result (3.3) of Theorem 3.2: Recall that V„ = M^E,7liM„. It 
clearly follows from Equation (4.1) that 

Vn ~ {On — 0)*S„-l(0,i — 0). 

Consequently, the asymptotic behavior oi On — is clearly related to the one 
of V„. More precisely, we can deduce from Corollary 4.3 and the fact that the 
eigenvalues of a matrix are continuous functions of its coefficients the following 
result 

lim l{|G'|>o} ^'°irpl^"'* = Ainin(S)lg a.s. 

«->oo I 11* I 

where Ainin(^) denotes the smallest eigenvalue of matrix A. Since L as well as 
S is definite positive, one has Aniin(S) > 0. Therefore, as 

II0„-0|P< ^" 



Amin(5j„_i) 

we use Result (4.2) of Theorem 4.4 to conclude that 

i{iG,i>o} fen -er = o 1^ = (^^^!?^) % a.s. 

which completes the proof of results (3.3). □ 
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We now prove the quadratic strong law (QSL). 

Proof of result (3.4) of Theorem 3.2: The QSL is a direct consequence of 
result (4.3) of Theorem 4.4 together with the fact that 6^ - 9 = H~^^Mn. 
Indeed, we have 

1 " 

1=1 

1 " _ 

= l{|G;|>O}-^(e£-e)*S,_i(0,~0) 

1=1 

1 " - Til I 

1 " 

1=1 

which completes the proof □ 



8. 2. Strong consistency for the variance estimators 

For ri > 1, set 

Vk — {52k£2k, 52k+l£2k+l)* , Vk = {S2k£2k, '52/£ + l£2fc+l )* • 

The almost sure convergence of ct,^ and p„ is strongly related to that of Vk~Vk- 
Proof of result (3.5) of Theorem 3.3: Equation(3.5) can be rewritten as 

hm l{|G.|>o}- \\Vk-Vkf^^7T-l)aHj a.s. 

Once again, we are searching for a link between the sum of || — V k\\ and the 
processes (An) and (V^) whose convergence properties were previously investi- 
gated. For i S {0, 1} and n > 0, let 



<52(2")+i ^2{2^+\)+% ■ • • <^2(2" + i-l)+i 

S2{2")+iX2" S2{2" + l)+iX2^^ + l ■■■ <52(2" + i-l)+i-'^2"+i-l 



be the collection of {S2k+i,S2k+iXky, k £ G„, and set 



< 
*i 
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Note that is a 4 x 2"+^ matrix. For all n > 1, we thus have, in the matrix 
form 

feeG„ fceG„ 

= (9n~ey^nKidn-0), 

where 

Now, we can deduce from Corollary (4.3) that 

lim l/|G-|>o}A„ = (tt - l)l4l£ a.s. 

n— fc30 "-I Til J ^ 

which implies that 

l{|G*|>o} E ll^fc-'^"fcll'=M*,S;^M„(7r-l + o(l))l{|G.|>o} a.s. 

feeG„ 

Therefore, we can conclude via convergence (4.3) that 



lim 1{|g;|>o}^ Y] \\Vk-Vkf 

*;eT„_i 

1 " 

= lim l{|G.|>o}^ -^M*S7_\M, = 4(^-1)^21^ a.s. 

which completes the proof. □ 
Proof of result (3.6) of Theorem 3.3: First of all, one has 



fell -Il»'fel 

feeT„_i 



\n {Wk-Vkf + 2{Vk-VkfVk) 



IT 

' '"' feeT„_i 
Set 

n 

Pn= (^fe-^fe)*^fe = E E c^^fe - ^fe)*'^fc- 

fceT„_i 1=1 feGGf_i 

We clearly have 

AP„+i = p„+i - p„ = ^ (i>fe - yfe)*Vfe. 

feeG„ 
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One can observe that for all k e G„, Vk — Vk is J^^-measurable. Consequently, 
(P„) is a real martingale transform for the filtration F'-'. Hence, we can deduce 
from the strong law of large numbers for martingale transforms given in Theorem 
1.3.24 of [6] together with (3.5) that 

l{|G*|>o}-Pn =o I Wk-Vk)\\A=o{n) a.s. 

It ensures once again via convergence (3.5) that 

IT* I 1 
hm l{|G.|>o}^(a^-a2) = lim l{|G.|>o}- ^ H^'^-^'^ll' 

fceT„_i 

= 4(7r-l)(T^l^ a.s. 
which completes the proof of result (3.6). □ 

Proof of results (3.7) and (3.8) of Theorem 3.3: We now turn to the study 
of the covariance estimator p„. We have 

= 1^*01 I X/ ^2k{£2k - £2k)S2k+l{£2k+l - £2k+l) + |™,^oi ,Qn, 



where 



Qn = ^ '52fc(52fc+l(e2fc — £2k)£2k+l + <^2/£^2fe+l (e2fc+l ^ £2k+l)£2k 

keTn-i 

= J2 iVk-VkY32Vk, 



with 



1 

1 



J2 = 

The process (Qn) is a real martingale transform for the filtration F'^ satisfying 



Qn^ol J2 ll^fc-'^fell' I =o(n) 
\fceT„_i 

It now remains to prove that 

lim 1{|G'|>0}— S2kS2k+l{£2k — £2k){£2k+l — £2k+l) 

R 



n— >oo " TL 

keT„ 



lim ^ = p(7r- l)ir((i:^)-i(L°'i)2(i")-i)l^ a.s. (8.1) 



n— J-oo Tl 
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where 

n 



where ® denotes the Kronecker product of matrices, i.e. 

) - ^fi^fy 

and ^^""^ is defined similarly as and <b\ by the collection of (i52fe'^2fc+i) 
52k52k+iXky, k e Gf. As - - proposition 4.2 implies 

that 



n— ^00 



so that the asymptotic behavior of Rn/n boils down to that of 

n 

M*s7if (J2 M,. 

A proof along the same lines as in Section 7 finally yields the expected results, 
i.e. 

which completes the proof of convergence (8.1). We then obtain 

which completes the proof of Theorem 3.3. □ 
8.3. Asymptotic normality 

Contrary to the previous literature on BAR processes, we cannot use the central 
limit theorem given by Propositions 7.8 and 7.9 of [11] as in [8, 3] because the 
normalizing term is now the number of observations and is therefore random. 
The approach used in [5] strongly relies on the gaussian assumption for the noise 
sequence that does not hold here. Instead, we use the central limit theorem for 
martingales given in Theorem 3. II. 10 of Dufio [6]. However, unlike the previous 
sections, this theorem can not be directly applied to the martingale {Mn) be- 
cause the number of observed data in a given generation grows exponentially 
fast and the Lindeberg condition does not hold. The solution is to use a new 
filtration. Namely, instead of using the observed generation-wise filtration, we 
will use the sister pair- wise one. Let 

=Oy a{5iXi, {62kX2k,52k+iX2k+i), l<k<p} 
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be the (j-algebra generated by the whole history O of the Galton- Watson pro- 
cess and all observed individuals up to the offspring of individual p. Hence 
(<52fee2fc, <52fe+i£2fc+i) is t/^-measurable. In addition, assumptions (HN.l) and 
(HI) imply that the processes (<52fe£2/c, ^fc<52fee2/c, ^2fc+i£2fe+i, -'^fc^2fc+i£2fc+i)*, 
i^2k£lk + ^2k+i£lk+i ~ ('^2*; + '52fc+i)o-^) and ((52fe(52fe+i(£2fe£2fc+i - p)) are ^f- 
martingale difference sequences. In all the sequel, we will work under the prob- 
ability and we denote by the corresponding expectation. 



Proof of Theorem 3.4, first step: We apply Theorem 3. II. 10 of [6] to the 

-martingale M^") = (M(,")){p>i} defined by 



Set Vn 



■LVJ. p 



IT 



with 



D 



k — 



«l k=l 



^2k£2k 
^kS2k£2k 
<52fe+l£2fe+l 
\ XkS2k+l£2k+l J 



2"+i _ 1. Note that if fc ^ T* , then Dk^O which implies that 



IT 



|T„| 



nl k=l 



IT 



E^- 



As the non-extinction set £ is in for every fc > 1, it is easy to prove that 



( Cr^(52fe <y'^^2kXk p62k52k+l 

Cr'^S2kXk ^'^52kXl p62kS2k+lXk 

pS2k52k+l pS2k52k+lXk (T^S2k+l 

V pS2kS2k+lXk pS2kS2k+lXl a'^62k+lXk 



pS2kS2k+lXk \ 



P^2kS2k+l 
0''^S2k+lXk 

cy^hk+iXl 



I 



and Corollary 4.3 gives the Pg- almost sure limit of the increasing process 



< 



it; 



^ E^[DkDl\g'^^,] = 



it: 



^ r. 



•2) 



Therefore, the first assumption of Theorem 3. II. 10 of [6] holds under P^. We now 
want to prove the Lindeberg condition that is the convergence in probability to 
of the following expression i„ for all e > 0: 



< 



< 



1 



E%[ 

fceT* 



suPfe>oE[||I?fe 



T* 



\g^_,] ^ EjiWDkf \gt,] 



\T1 



E 



it; 



\5t 



i) 
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for some r > 2 and thanks to Holder and Chebyshev inequalities. Besides, using 
Eq. (6.1) and similar calculations as in Lemma 6.1, one readily obtains 

X8 < 2^(1 - E^/Jl^f^]! + 2'f3'^-X!. 

fe=0 

Now, assumption (HN.l) together with f3 < 1 yield the existence of a constant 
C such that 

supE[X^*] < C{l + E[Xf]), 

k>0 

and recall that ]E[Xf] < oo. Finally, since the entries of are combinations of 
£2fc+i E^nd Xk, using again (HN.l) and (HI), one obtains that 

supE[|lDfc|r|ef_i] <oo a.s. 

fc>0 

with r = 8. The Lindeberg condition is thus proved, plugging the convergence 
(8.2) into the following equality: 

I nl f.^j^ I n\ 

We can now conclude that under 

, ^ V Di. = — ^=M„ A A/'(o,r). 

Finally, result (3.9) follows from Eq. (4.1) and Corollary 4.3 together with Slut- 
sky's Lemma. □ 

Proof of Theorem 3.4, second step: On the one hand, we apply Theo- 
rem 3.II.10 of [6] to the g^-martingale M^") = (il/4"^){p>i} defined by 

1 ^ 

A^^"^ = ^p= E "fc and Vk = 62kelk + S2k+iel+i - {S2k + S2k+i)<j'^ ■ 

Vl^nl k=l 

As above, one clearly has 

Using assumptions (HN.l), (HI) and Lemma 5.3 we compute the limit of the 
increasing process under ¥j 

lim < M(") >,„= {t" - a") + Mill) - a") Fj a.s. 
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Therefore, the first assumption of Tlieorem 3. II. 10 of [6] holds under P^. Thanks 
to assumptions (HN.2) and (HI) we can prove that for some r > 2, 

supEg-[||i;fc||''|5^_;^] < oo a.s. 

fc>0 

which implies the Lindeberg condition. Therefore, we obtain that under P^ 

2\ -C^ ,r,,. ,4 4s , 2p(l,l) 



TT 

Furthermore, we infer from Eq. (3.6) that 

hm Vfnii^n - <yl) = Pj a.s. 

which yields result (3.10). 

We turn now to the proof of result (3.11) with another C/^ -martingale (M^" 
defined by 



1 ^ 

M(") = , V (52fe(52/c + l(£2fe£2fc+l - P)- 

As above, one easily shows that 



Mi^)^ J 52^52^+l{e2^e2^+l-p)^J\^\{Pn-p)■ 



lim < Af(") - -^'^ .2 



Using assumptions (HN.l) and (H.I), we compute the limit of the increasing 
process 

V T ~ p 



We also derive the Lindeberg condition. Consequently, we obtain that under P^, 
one has 

Furthermore, we infer from (3.8) that 



lim a/|T;0_\|(p„ - p„) = P^ a.s. 

Finally, result (3.11) follows, which completes the proof of Theorem 3.4. □ 

Appendix A: Quadratic strong law 

In order to establish the quadratic strong law for the main martingale (iW„), we 
are going to study separately the asymptotic behavior of (yV„) and {Bn) which 
appear in the main decomposition given by Equation (7.1). 
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Lemma A.l Under assumptions (HN.l), (HN.2), (HO) and (HI), we have 

lim 1{|G*|>0}-W„ = ^^^^^ -^o-^lf a.s. 

Proof : First of all, we have the decomposition Wn+i — Tn+i + Ti-n+i where 

_ ^ AM*+i(|T,1S7^ - S-^)AM,+i 

K-n+l — 2^ |.jp^| 
1=1 l^^l 

We first prove that 

lim l{\G'\>o}-Ta^— — ^cr^% a.s. (A.l) 

As Tn is a scalar and the trace is commutative, one can rewrite Tn+i as Tn+i = 
tr(S-i/2^„+iS-i/2) ^Yieie 

Our goal is to make use of the strong law of large numbers for martingale trans- 
forms, so we start by adding and subtracting a term involving the conditional 
expectation of A7in+i given J^^ . We have already seen in Section 4.1 that for 
all n, E[AM„+iAM^+i|J-0] = T^n Tyj—i. Consequently, we can split ^i^^i 
into two terms 



IT 

where 

_ ^ AM,+iAM*+i - (r, - r,_i) 

e=i l^^l 
On the one hand, it follows from Corollary 4.3 and Lemma 2.1 that 

^hin^l{|G.|>o} = — 

Thus, Cesaro convergence and the remark that {\G'g \ = 0} C {|G* | = 0} for all 
£ < n yield 



vr- 1, 



-ri^ a.s. 
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On the other hand, the sequence (/C„) is obviously a matrix martingale trans- 
form and tedious but straightforward calculations, together with Lemmas 6.1 
and 6.2 and the strong law of large numbers for martingale transforms given in 
Theorem 1.3.24 of [6] imply that |>o}KI„ = o(n) a.s. Hence, we infer from 
the equation above that 

lim l{|G.|>o}-Hn = ^^r% a.s. (A.2) 

Finally, we obtain 

lim lnG.|>o}-7;, = ^^<r(E-i/2rs-i/2)l^= ^^4(t2i a.s. 

which proves (A.l). We now turn to the asymptotic behavior ofTZn+i- We know 
from Proposition 4.2 that |>o} (ITT* — S^^) goes to as n goes to 

infinity. Hence, for all positive e and for large enough n, one has 

1{|g.|>o}|AM:,+i(|T:|S,;1 -S-i)AM„+i| < 1{|g. |>o}4eAM^iAM„+i. 

Using again that {\G'g \ = 0} C = 0} for all £ < n + 1, we rewrite 

, ^ AM*^,(|T||S7^-S-^)AM,+i 
^{|G;+il>o}/<-«+i-JL{|G;+j>o}2^Jt{|Gj|>o} |;jj;5r| • 



Hence, 



l{|G:+il>0}l^n+l| < 4el{|G;^j>o}^l{|G-|>0} 



it: I 



" AM|+iAM^ 
- 4el{|G;+J>o} 2^ 

< 4el{|G;^j>o}t''(^n+i). 

This last inequality holding for any positive e and large enough n, the limit 
given by Equation (A.2) entails that 

lim l{|G*|>o}-^« = a.s. 

which completes the proof of Lemma A.l. □ 
Lemma A.2 Under assumptions (HN.l), (HN.2), (HO) and (HI), we have 

lim Inc. |>o}-Sri = a.s. 

n— f+oo "-I III ^ ji 

Proof : The result is obvious on the extinction set £. Now let us work on £. 
Now for i e {0, 1} and n > 1, let = (e2"+i, e2"+2+i, • • ■ , £2"+i-2+i)*i be the 



imsart-ejs ver. 2009/08/13 file: EJS1106-014-revised.tex date: September 28, 2011 



B. de Saporta, A. Gegout-Petit, L. Marsalle/Estimation for missing data BAR 39 

collection of Sk, k E GJj, and set ^„ — (^^, Note that ^„ is a column vector 
of size 2"+^. With these notation, one has 

n n 

t=i 1=1 
The sequence (S„) is a real martingale transform satisfying 

Ai3„+i = Bn+i -Bn = 2M*,S;i*„|„+i. 

Consequently, via the strong law of large numbers for martingale transforms, 
we find that either (S„) converges a.s. or S„ = o(< B >„) a.s. where 

n 

< B >„+!= 4^ M*S7i*,C*^S7iM,, 

i=i 

with 

P 

As C is definite positive under assumption (HN.l), one has C < 2(T^l2n+i in 
the sense that 2cr^l2ii+i — C is semi definite positive. Hence, one has 

n 

< B >n+i< 8cr^ J2 A^fS^i^f^^S^^M^. 

£=1 

Now, by definition, one has 

- [ (sir^^ii^insir^ ) ■ 

We now use Lemma B.l of [3] on each entry to obtain 

as the matrix in that lemma is definite positive. Therefore, we obtain that 

n 

< B >„+!< 8<j^ ^IC^i-i - S7')M, = Sa^An- 

Finally, we deduce from the main decomposition given by Equation (7.1) and 
Lemma A.l that 

1{|g;|>o}(K+i + A) = o{An) + 0{n) a.s. 

leading to 1{|g* |>o}Hi+i 0{n) and 1{|g* |>o}-4„ = 0(7i) a.s. as V„+i and ^„ 
are non-negative. This implies in turn that l{|G*|>o}'Sn = a.s. completing 
the proof of Lemma A. 2. □ 
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Appendix B: Wei's Lemma 

In order to prove Proposition 7.2, we shall apply Wei's Lemma given in [16] 
page 1672, to each entry of the vector- valued main martingale 

n 

-^Ji = ^ ^ {S2k£2kiS2kXkS2k,S2k + l£2k+l,S2k+lXk£2k+lY ■ 

For i € {0, 1}; denote 

n n 
-P,i = ^ ^ 52k+i£2k+i and Q\ — ^ ^ 52k+i^k£2k+i- 

1=1 fceGf_i 1=1 keGt-1 

On the set £, these processes can be rewritten as 

n n 

1=1 1=1 

where 

On the one hand, we clearly have E[uj,^JJ"^] = and '¥\(v\^^'^\T^\ = o-^^Jp 
a.s. on E. Moreover, it follows from Cauchy-Schwarz inequality that 



< 3C1{|g;|>o} sup W2k+t\^n\ a.s. 
fceG„ 

as Z^^]^ |G* 1^"'" is bounded. This implies that supE[(u*j_|_]^)*|J^^] < +oo a.s. Con- 
sequently, we deduce from Wei's Lemma that for all r\ > 1/2, 



_^l>o}ra'-o(|T:_Jn'')% 



a.s. 
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On the other hand, it is not hard to see that E[wJj^_|_]^|J^^] = a.s. Moreover, it 
follows from Cauchy-Schwarz ineqiiaHty that. 



< ^{|g;i>o} 



^fceG„ peGn k^p 

sup E[4 Jj-f]") ^^f^+^^i] 



which is finite from Proposition 6.5. We deduce from Wei's Lemma apphed to 
that for all r] > 1/2, 1{|g;_ J>o} IIQniP = o{\T:^_i\n^) a.s. which completes 
the proof of Proposition 7.2. □ 



References 



Krishna B. Athreya and Peter E. Ney. Branching processes. Springer- 
Verlag, New York, 1972. Die Grundlehren der mathematischen Wis- 
senschaften. Band 196. 

Vincent Bansaye. Proliferating parasites in dividing cells: Kimmel's branch- 
ing model revisited. Ann. Appl. Probab., 18(3):967-996, 2008. 
Bernard Bercu, Benoite de Saporta, and Anne Gegout-Petit. Asymptotic 
analysis for bifurcating autoregressive processes via a martingale approach. 
Electron. J. Probab., 14:no. 87, 2492-2526, 2009. 

R. Cowan and R. G. Staudte. The bifurcating autoregressive model in cell 
lineage studies. Biometrics, 42:769-783, 1986. 

J.-F. Delmas and L. Marsalle. Detection of cellular aging in a Galton- 
Watson process. Stock. Process, and Appl, 120:2495-2519, 2010. 
Marie Duflo. Random iterative models, volume 34 of Applications of Math- 
ematics. Springer- Verlag, Berlin, 1997. 

Steven N. Evans and David Steinsaltz. Damage segregation at fissioning 
may increase growth rates: A superprocess model. Theoretical Population 
Biology, 71(4):473 - 490, 2007. 

Julien Guyon. Limit theorems for bifurcating Markov chains. Application 
to the detection of cellular aging. Ann. Appl. Probab., 17(5-6): 1538-1569, 
2007. 

Julien Guyon, Ariane Bize, Gregory Paul, Eric Stewart, Jean-Francois Del- 
mas, and Francois Taddei. Statistical study of cellular aging. In CEMRACS 
2004 — mathematics and applications to biology and medicine, volume 14 of 
ESAIM Proc, pages 100-114 (electronic). EDP Sci., Les Ulis, 2005. 
Patsy Haccou, Peter Jagers, and Vladimir A. Vatutin. Branching processes: 
variation, growth, and extinction of populations. Cambridge Studies in 
Adaptive Dynamics. Cambridge University Press, Cambridge, 2007. 
James D. Hamilton. Time series analysis. Princeton University Press, 
Princeton, NJ, 1994. 



imsart-ejs ver. 2009/08/13 file: EJS1106-014-revised.tex date: September 28, 2011 



B. de Saporta, A. Gegout-Petit, L. Marsalle/Estimation for missing data BAR 42 



[12] Theodore E. Harris. The theory of branching processes. Die Grundlehren 
der Mathematischen Wissenschaften, Bd. 119. Springer- Verlag, Berlin, 
1963. 

[13] Roger A. Horn and Charles R. Johnson. Matrix analysis. Cambridge Uni- 
versity Press, Cambridge, 1990. Corrected reprint of the 1985 original. 

[14] F. Maaouia and A. Touati. Identification of multitype branching processes. 
Ann. Statist., 33(6):2655-2694, 2005. 

[15] E.,J. Stewart, R. Madden, G. Paul, and F. Taddei. Aging and death in an 
organism that reproduces by morphologically symmetric division. PLoS 
Biol, 3(2):e45, 2005. 

[16] C. Z. Wei. Adaptive prediction by least squares predictors in stochastic re- 
gression models with applications to time series. Ann. Statist., 15(4):1667- 
1682, 1987. 



imsart-ejs ver. 2009/08/13 file: EJS1106-014-revised.tex date: September 28, 2011 



